Arabic Nested Noun Compound Extraction Based on Linguistic Features and Statistical Measures
نویسندگان
چکیده
منابع مشابه
Collocational Translation Memory Extraction Based on Statistical and Linguistic Information
In this paper, we propose a new method for extracting bilingual collocations from a parallel corpus to provide phrasal translation memories. The method integrates statistical and linguistic information to achieve effective extraction of bilingual collocations. The linguistic information includes parts of speech, chunks, and clauses. The method involves first obtaining an extended list of Englis...
متن کاملAutomatic Arabic Text Summarization System Based on Semantic Features Extraction
Recently, one of the problems arisen due to the amount of information and it’s availability on the web, is the increased need for effective and powerful tool to automatically summarize text. For English and European languages an intensive works have been done with high performance and nowadays they look forward to multi-document and multi-language summarization. However, Arabic language still s...
متن کاملAutomatic Phonetization-based Statistical Linguistic Study of Standard Arabic
Statistical studies based on automatic phonetic transcription of Standard Arabic texts are rare, and even though studies have been performed, they have been done only on one level – phoneme or syllable – and the results cannot be generalized on the language as a whole. In this paper we automatically derived accurate statistical information about phonemes, allophones, syllables, and allosyllable...
متن کاملInfluence of accurate compound noun splitting on bilingual vocabulary extraction
The influence of compound noun splitting on a German-Polish bilingual vocabulary extraction task is investigated. To accomplish this, several unsupervised methods for increasingly accurate compound noun splitting are introduced. Bilingual evidence from a parallel German-Polish corpus and co-occurrence counts from the web are used to disambiguate compound noun analyses directly. These collected ...
متن کاملAtrial Activity Extraction Based on Statistical and Spectral Features
Atrial fibrillation is the most common human arrhythmia. The analysis of the associated atrial activity provides features of clinical relevance. Previously, the extraction of the atrial signal is necessary. We follow the semi Blind Source Extraction S-BSE approach to solve the problem. The proposed algorithm satisfies the prior knowledge about the atrial signal: its statistical properties and i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: GEMA Online® Journal of Language Studies
سال: 2018
ISSN: 1675-8021,2550-2131
DOI: 10.17576/gema-2018-1802-07